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(54) Apparatus and method for motion vector estimation with high speed 



(57) In a motion vector estimating apparatus, a cur- 
rent picture storage unit (1 ) stores image data of a cur- 
rent picture, and a reference picture storage unit (2) 
stores image data of a reference picture. A search win- 
dow determining unit (4) determines estimation history 
from previously estimated motion vectors, and deter- 
mines a search window based on the estimation history. 
At least one of a shape, size and position of the search 



window is determined based on the estimation history. 
The search window is composed of rectangular refer- 
ence regions. A block matching circuit (300-1 , 300-2) for 
performing a block matching process to a current block 
and each of reference blocks of the search window to 
determine a motion vector. The search window may be 
limited in units of pixels, or a load of the apparatus, a 
power supply voltage or a temperature of the block 
matching circuit. 
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Description 

Background ol the Invention 

s 1. Field of the Invention 

The present invention is related to an apparatus and a method for estimating a motion vector utilized to compensate 
for motion of an image in a prediction coding process of a moving picture. 

io 2. Description of the Related Art 

In a conventional a method and apparatus for estimating a motion vector estimating method, a current picture to 
which a prediction coding process is performed is divided into a plurality of blocks, for example, to have 16 pixels in a 
horizontal direction and 1 6 pixels in a vertical direction. These blocks are referred to as current blocks. A search region 

is is predetermined in a reference picture for each of the above-described current blocks to have as a center of the search 
region the positions corresponding to the horizontal position and the vertical position of a pixel position of each current 
block. The predetermined search region in the reference picture is referred to as a search window A block matching 
process is carried out between a specific one of the current blocks and the search window corresponding to the specific 
current block to estimate a position on which a predetermined evaluation function has the minimum value for obtaining 

2Q a motion vector. The conventional motion vector estimating apparatus is directed to a purpose that the motion vector 
can be more correctly obtained. 

A first conventional example of a motion vector estimating apparatus is disclosed in, for example, Japanese Laid 
Open Patent Application (JP-A-Showa 61 -201 583). In the reference, a limit value is provided to limit a reference region 
used to search the motion vector in accordance with a change amount of a motion vector detected in a previous frame 

25 to a current frame. When the motion vector change amount exceeds the limit value, this limit value is outputted. 

Also, a second conventional example of the motion vector estimating apparatus is proposed in Japanese Laid 
Open Patent Application (JP-A-Heisei 5-328333), in which several search windows or regions for a motion vector are 
prepared to be different from each other in position. One of the several reference windows or regions is selected in 
accordance with the change amounts of the motion vectors detected in a previous frame to the current frame. 

30 Further, a third conventional example of the motion vector estimating apparatus is proposed in Japanese Laid 

Open Patent Application (JP-A-Heisei 7-203457), in which an accumulating process of absolute values of differences 
is executed in a parallel manner in the block matching process. As a result, the capacity of a buffer for signals used in 
the accumulation is reduced so as to increase the processing speed. 

Furthermore, a fourth conventional example of the motion vector estimating apparatus is proposed in Japanese 

35 Laid Open Patent Application (JP-A-Heisei 7-250328), in which motion vectors used to compensate for the moving 
image prediction are estimated in accordance with a plurality of prediction modes with a small hardware amount. 

There are the following problems in the above-described conventional examples of the motion vector estimating 
techniques. 

(1 ) The first problem is in the following point. That is, power consumption cannot be reduced in the first conventional 
40 example of the motion vector estimating apparatus in which the limit value is provided to limit the search region for the 

motion vector. This is because in the motion vector estimating apparatus described in Japanese Laid Open Patent 
Application (JP-A-Showa 61-201583), the limit value is only introduced to the reference region. The calculation time 
and the calculation amount for the block matching process to be executed cannot be reduced in actual. In other words, 
since the calculation time and the calculation amount cannot be reduced, the power consumption of the motion vector 

45 estimating apparatus cannot be reduced. 

In the block matching process as described in, for instance, Japanese Laid Open Patent Application (JP-A-Heisei 
7-203457) and Japanese Laid Open Patent Application (JP-A-Heisei 7-250328), the operation time is reduced by 
employing parallel processing. However, the reason why the operation time cannot be reduced is that if the limit value 
is provided as in the motion vector estimating apparatus described in Japanese Laid Open Patent Application (JP-A- 

so Showa 61 -201 583), the parallel degree of the above-described parallel processing decreases. As a result, the operation 
time cannot be reduced. 

(2) The second problem is in that the correct motion vector cannot be obtained in the conventional motion vector 
estimating apparatus as proposed in Japanese Laid Open Patent Application (JP-A-Heisei 5-328333), in which the 
several reference regions for the motion vector are prepared. This is because in the method for selecting the optimum 

55 reference region from the prepared reference regions, when pictures are such complex that more than one object 
move, the object moving direction cannot be exclusively determined. As a result, the optimum reference region cannot 
be selected from the prepared several reference regions. 

Also, in the above-described motion vector estimating apparatus described in Japanese Laid Open Patent Appli- 
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cation (JP-A-Heisei 5-328333), if the size and/or shape of the reference region cannot be previously determined, then 
the parallel degree of the apparatus for executing the block matching process would be lowered. As a consequence, 
since the shape of the reference region is limited and thus the proper reference region cannot be utilized, the correct 
motion vector cannot be obtained. 

5 (3) The third problem is in that in the motion vector estimating apparatus proposed in Japanese Laid Open Patent 

Application (JP-A-Heisei 7-203457), the difference absolute value accumulating process is carried out in the parallel 
manner using the small buffer for signals for the calculation. However, in the motion vector estimating apparatus, the 
hardware amount of the block matching circuit for executing the difference absolute value accumulating process in the 
parallel manner increases so that the chip area for the block matching circuit is increased as well as the power con- 

10 sumption is increased. This is because in the motion vector estimating apparatus as described in Japanese Laid Open 
Patent Application (JP-A-Heisei 7-203457) and Japanese Laid Open Paten Application (JP-A-Heisei 7-250328), a large 
amount of wiring lines must be provided for a processing element as the minimum unit of the block matching circuit. 
Thus, the valid data can be continuously entered into the respective processing elements such that the parallel degree 
is increased. As a consequence, a chip area required for the wiring lines increases considerably. 

'5 Also, in the above-explained motion vector estimating apparatus described in Japanese Laid Open Patent Appli- 

cation (JP-A-Heisei 7-203457) and Japanese Laid Open Patent Application (JP-A-Heisei 7-250328), there is another 
problem. That is, the minimum value determining unit determines the minimum value of a summation of absolute values 
of differences accumulated values in the respective processing elements, and the chip area required for wiring lines 
is considerably increased, because a total number of wiring lines is very large which are used to be connected from 

20 the minimum value determining unit. 

(4) The fourth problem is in that if the size and/or shape of the reference region is allowed to be varied in the motion 
vector estimating apparatus in which the difference absolute value accumulating process is executed in the parallel 
manner, the hardware amount of the block matching circuit for executing the difference absolute value accumulation 
process in the parallel manner is increased. As a result, the chip area increases and also the power consumption 

25 increases. Also, at this time, the parallel degree is reduced. This is because the large amount of wiring lines must be 
provided to continuously enter the valid data into the processing elements as the minimum unit for constituting the 
block matching circuit. 

Also, it is because the data entering sequence of the processing element is greatly changed, depending upon the 
sizes and the shapes of the reference regions. As a result, the process for reading out the reference data from a storage 
so unit becomes complex, so that the chip area increases and the power consumption increases. 

(5) The fifth problem is in that the chip area of the block matching circuit increases in the motion vector estimating 
apparatus in which the motion vector is estimated in a plurality of prediction modes. This is because a large number 
of wiring lines is required to connect the minimum value determining unit for determining the minimum value of the 
difference absolute value accumulated values to the respective processing elements in the motion vector estimating 

35 apparatus described in Japanese Laid Open Patent Application (JP-A-Heisei 7-250328). As a result, the chip area 
required for the wiring lines is considerably increases. 

Also, the stage number of FIFOs (first-in first-out) required for the block matching circuit is increased. 

Summary of the Invention 

40 

As a consequence, the present invention has been made to solve the above-described problems. Therefore, an 
object of the present invention is to provide a motion vector estimating method and a motion vector estimating apparatus 
which can estimate a motion vector in high precision with low power consumption and a saved chip area. 

Another object of the present invention is to provide a motion vector estimating method and a motion vector esti- 
45 mating apparatus, in which when a parallel processing is applied to a block matching process, the parallel degree 
thereof is not reduced. 

Still another object of the present invention is to provide a motion vector estimating method and a motion vector 
estimating apparatus, in which motion vectors of a plurality of prediction modes can be estimated while saving the chip 
area of a block matching circuit. 
50 Yet still another object of the present invention is provide a motion vector estimating apparatus, which can be 

realized with the saved chip area of a block matching circuit regardless of the dimension and shape of a reference 
region. 

In order to achieve an aspect of the present invention, a motion vector estimating apparatus includes a current 
picture storage unit which stores image data of a current picture, a reference picture storage unit which stores image 
55 data of a reference picture, a search window determining unit which determines estimation history from previously 
estimated motion vectors, and determines a search window based on the estimation history, wherein the search window 
is composed of rectangular reference regions, a block matching circuit for performing a block matching process to a 
current block and each of reference blocks of the search window to determine a motion vector, and a control circuit 
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which reads out the image data of the current block from the current picture storage unit to supply the block matching 
circuit, which reads out the image data of each of the reference blocks of the search window from the reference picture 
storage unit to supply to the block matching circuit, and which controls the block matching circuit such that the block 
matching process is performed to the current block and the each reference block of the search window. 

s At least one of a shape of the search window, a size of the search window and a position of the search window is 

determined based on the estimation history. 

The search window determining unit determines whether the number of previously estimated motion vectors ex- 
ceeds a predetermined number. When it is determined that the number of previously estimated motion vectors is less 
than the predetermined number, the search window determining unit neglects the previously estimated motion vectors 

io and determines optimal size and shape of the search window. On the other hand, when it is determined that the number 
of previously estimated motion vectors exceeds the predetermined number, the search window determining unit pro- 
duces a 2-dimensional map in which each of the previously estimated motion vectors is mapped based on its horizontal 
value and the vertical value. Thus, the search window determining unit determines optimal size and shape of the search 
window such that the search window includes the history of the previously estimated motion vectors. Also, the search 

is window determining unit calculates average values of changes of the previously estimated motion vectors in the hor- 
izontal and vertical directions, respectively, and determines an optimal position of the search window such that, for 
example, a center position of the search window is shifted based on the average values. In this manner, the size, shape 
and position may be independently determined. On the contrary, the size, shape and position may be collectively 
determined. 

20 The motion vector estimating apparatus may further include a load detecting circuit for detecting either one of a 

load of the motion vector estimating apparatus or a load of the block matching circuit. In this case, the search window 
determining unit determines the search window based on the estimation history and the detected load. Also, the motion 
vector estimating apparatus may further include a voltage detecting circuit for detecting a power supply voltage. In this 
case, the search window determining unit determines the search window based on the estimation history and the 

25 detected power supply voltage. Further, the motion vector estimating apparatus may further include a temperature 
detecting circuit for detecting a temperature of a chip on which the block matching circuit is installed. In this case, the 
search window determining unit determines the search window based on the estimation history and the detected tem- 
perature. In addition, the search window determining unit may determine at least an unuseful region in the search 
window in unit of pixels based on the estimation history, and set a flag to each of pixels of the unuseful region such 

30 that the block matching process to the reference block which includes any pixel of the unuseful region is made invalid 
based on the flag of the pixel of the unuseful region. 

The block matching circuit includes processing elements of M columns and N rows (M and N are positive integers) 
provided such that a likelihood between the current block and each of the reference blocks is calculated through pipeline 
processing. In this case, the block matching circuit may further include a hierarchical selector structure such that each 

35 of the processing elements is supplied with one of the pixel data of each of the reference blocks. Also, the block 
matching circuit may further include (M -t- N) reference data buses, which are provided for the M columns and the N 
rows, and N current data buses provided for the N rows, respectively. In this case, each of the (M + N) reference data 
buses transfers one pixel data of any of the reference blocks in a unit time of the pipeline processing. The current block 
is composed of M columns and N rows. 

40 The processing elements may be cascade-connected in each of the N rows and the N rows may be cascade - 

connected. 

In another case, the processing elements may be cascade-connected in each of the N rows (N is an even number). 
Odd-numbered ones of the N rows in an upper half are cascade-connected via first FIFO units, each of which is provided 
for two adjacent odd-numbered rows, wherein even-numbered ones of the N rows in the upper half are cascade- 

45 connected via second FIFO units, each of which is provided for two adjacent even-numbered rows, wherein odd- 
numbered ones of the N rows in a lower half are cascade-connected via third FIFO units, each of which is provided 
for two adjacent odd-numbered rows, and wherein even-numbered ones of the N rows in the lower half are cascade- 
connected via fourth FIFO units, each of which is provided for two adjacent odd-numbered rows. Further, the block 
matching circuit further includes fifth and sixth FIFO units connected to last stages of processing elements in the 

so cascade connections for the odd-numbered rows and the even-numbered rows in the upper half, respectively. In this 
case, a number of stages in each of the first to fourth FIFO units is M, and a number of stages in the fifth FIFO unit is 
(M x N/2 + M), and a number of stages in the sixth FIFO unit is (M x N/2). 

In order to achieve another aspect of the present invention, a motion vector estimating apparatus includes process- 
ing elements of M columns and N rows (M and N are positive integers) , wherein the processing elements are cascade- 

55 connected in each of the N rows, wherein each of the processing elements calculates an absolute value of a difference 
between a corresponding pixel data of a current block and one pixel data of each of reference blocks, and adds the 
absolute value to an output from a previous stage of processing element to output the adding result to a next stage of 
processing element, and wherein the N rows are cascade-connected to allow pipeline processing, M reference data 
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column buses provided for the M columns, respectively, N reference data row buses provided for the N rows, respec- 
tively, (M x N) selectors, which are respectively provided for the processing elements to allow one of the pixel data on 
the corresponding one of the M reference data column buses and the pixel data on the corresponding one of the N 
reference data row buses to the corresponding processing element as the reference block pixel data, and a current 

s block pixel data bus for transferring current block pixel data to each of the processing elements. 

The motion vector estimating apparatus may further includes a reference block data bus group for transferring 
pixel data of the reference blocks in units of rows as sequences of pixel data, a first group of selectors which are 
respectively provided for the M reference data column buses, and each of which selects one pixel data of one of the 
sequences to supply to the corresponding reference data column bus, and a second group of selectors which are 

10 respectively provided for the N reference data row buses, and each of which selects one pixel data of one of the 
sequences to supply to the corresponding reference data row bus. In this case, each of the processing elements latches 
the corresponding current block pixel data once the corresponding current block pixel data is supplied via the current 
block pixel data bus. Also, each of the processing elements is synchronized with a clock signal to perform the calculation 
of the absolute value of the difference and the addition and to output the addition result to the next stage of processing 

is element. Thus, the processing elements calculates a likelihood of one of the reference blocks in a time period from a 
first time to (M x N) time, and calculates a likelihood of a different one of the reference blocks in a time period from a 
second time to (M x N + 1 ) time. 

The current block is composed of pixels of M columns and N rows. 

In a case, the processing elements are cascade-connected in each of the N rows and the N rows are cascade- 
20 connected. 

In another case, the processing elements are cascade-connected in each of the N rows (N is an even number). 
In addition, odd-numbered ones of the N rows in an upper half are cascade-connected via first FIFO units, each of 
which is provided for two adjacent odd-numbered rows, even-numbered ones of the N rows in the upper half are 
cascade-connected via second FIFO units, each of which is provided for two adjacent even-numbered rows, odd- 

25 numbered ones of the N rows in a lower half are cascade-connected via third FIFO units, each of which is provided 
for two adjacent odd-numbered rows, and even-numbered ones of the N rows in the lower half are cascade-connected 
via fourth FIFO units, each of which is provided for two adjacent odd-numbered rows. Also, the block matching circuit 
further includes fifth and sixth FIFO units connected to last stages of processing elements in the cascade connections 
for the odd-numbered rows and the even-numbered rows in the upper half, respectively. In this case, a number of 

30 stages in each of the first to fourth FIFO units is M, and a number of stages in the fifth FIFO unit is (M x N/2 + M), and 
a number of stages in the sixth FIFO unit is (M x N/2). The motion vector estimating apparatus may further include a 
fist adder adding an output of the fifth FIFO unit and an output of the sixth FIFO unit, a second adder adding the output 
of the fifth FIFO unit and an output of the processing element in a last stage of odd-numbered cascade connection in 
the lower half, a third adder adding the output of the sixth FIFO unit and an output of the processing element in a last 

35 stage of even-numbered cascade connection in the lower half, a fourth adder adding the output of the processing 
element in the last stage of odd-numbered cascade connection in the lower half and the output of the processing 
element in the last stage of even-numbered cascade connection in the lower half, and a fifth adder adding an output 
of the second adder and an output of the third adder. 

In still another case, the processing elements are cascade-connected in each of the N rows (N is an even number). 

40 Also, ones of the N rows in an upper half are cascade-connected via first FIFO units, each of which is provided for two 
adjacent rows, and ones of the N rows in a lower half are cascade-connected via second FIFO units, each of which is 
provided for two adjacent rows. In addition, the block matching circuit further includes a third FIFO units connected to 
the processing element in a last stage of cascade connection in the upper half. In this case, a number of stages in 
each of the first and second FIFO units is M, and a number of stages in the third FIFO unit is (M x N/2). The motion 

45 vector estimating apparatus may further include an adder adding an output of the third FIFO unit and an output of the 
processing element in a last stage of cascade connection in the tower half. 

In yet still another case, the processing elements are cascade-connected in each of the N rows (N is an even 
number). Also, odd-numbered ones of the N rows are cascade -connected via first FIFO units, each of which is provided 
for two adjacent odd-numbered rows, and even-numbered ones of the N rows are cascade-connected via second FIFO 

so units, each of which is provided for two adjacent even-numbered rows, fn this case, a number of stages in each of the 
first and second FIFO units is M. The motion vector estimating apparatus may further include an adder adding an 
output of the processing element in a last stage of odd-numbered cascade connection and an output of the processing 
element in a last stage of even-numbered cascade connection. 

In order to achieve still another aspect of the present invention, a method of estimating a motion vector, includes 

55 the steps of: 

determining estimation history from previously estimated motion vectors; 

determining a search window based on the estimation history, wherein the search window is composed of rectan- 
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gular reference regions; and 

performing a block matching process to a current block and each of reference blocks of the search window to 
determine a motion vector. 

5 Brief Description of the Drawings 

Fig. 1 is a block diagram illustrating of the structure of a motion vector estimating apparatus according to a first 
embodiment of the present invention; 

Fig. 2 is a block diagram illustrating the structure of a block matching unit in the motion vector estimating apparatus 
10 in the first embodiment shown in Fig. 1 ; 

Fig. 3 is a connection diagram for explaining connections of a processing element in the block matching unit in 
the first embodiment shown in Fig. 2; 

Fig. 4 is a block diagram illustrating the internal structure of a processing element shown in Fig. 3; 

Fig. 5 is a block diagram illustrating the structure of a motion vector monitoring unit of the motion vector estimating 
is apparatus in the first embodiment shown in Fig. 1 ; 

Fig. 6A is a diagram for explaining a search window which is formed in such a manner that a plurality of rectangular 

shapes are synthesize, in the first embodiment of the present invention; 

Fig. 6B is a partial expanded diagram for explaining the search window shown in Fig. 6A; 

Fig. 7 is a diagram for explaining the relation of a previous picture, a search window including reference blocks 
20 and a current block of a current picture; 

Figs. 8A and 8B are schematic diagrams for explaining a block matching process operation of the block matching 

unit in the first embodiment of the present invention; 

Figs. 9A to 9F are schematic diagrams for explaining the block matching process operation of the block matching 
unit and the supply and selection of a reference block image data in the first embodiment of the present invention; 
25 Fig. 10 are schematic diagrams for explaining the supply and selection of a reference block image data in the first 

embodiment of the present invention; 

Fig. 11 is a block diagram for explaining a block matching circuit in a second embodiment of the present invention; 
Fig. 12 is a block diagram for explaining a block matching circuit in a third embodiment of the present invention; 
Fig. 1 3 is a diagram for explaining the supply and selection of a reference block image data in the third embodiment 
30 of the present invention shown in Fig. 12; 

Fig. 14 is a diagram illustrating the structure of the motion vector monitoring unit in a fourth embodiment of the 
present invention; 

Fig. 15 is a diagram to explain the operation of the motion vector monitoring unit in the fourth embodiment shown 
in Fig. 14; 

35 Fig. 16 is a block diagram illustrating the internal structure of a processing element of the block matching circuit 

in the fourth embodiment; and 

Fig. 17 is a block diagram illustrating the structure of the motion vector estimating apparatus according to a fifth 
embodiment of the present invention. 

40 Description of the Preferred Embodiments 

A motion vector estimating apparatus of the present invention will be described below in detail with reference to 
the accompanying drawings. 

Fig. 1 is a block diagram illustrating the structure of the motion vector estimating apparatus according to the first 
45 embodiment of the present invention. Referring now to Fig. 1 , the motion vector estimating apparatus in the first em- 
bodiment is composed of a current picture data storage unit 1 , a reference picture data storage unit 2, a block matching 
unit 3, a motion vector monitoring unit 4, and a motion vector storage unit 5. 

The current picture data storage unit 1 stores image data of a current picture and pixel data C for pixels of a current 
block are read out from the storage unit 1 onto a signal line 101 by the block matching unit 3. The reference picture 
so data storage unit 2 stores image data of a reference picture and pixel data for pixels of a reference block within a 
predetermined search window are read out from the storage unit 2 onto a signal line 121 by the block matching unit 
3. The reference picture may be a previous picture to the current picture, or a different picture useful for estimating the 
motion vector. 

The block matching unit 3 reads out the current block pixel data from the current picture data storage unit 1 and 
55 the reference block picture data from the reference picture data storage unit 2 for a block matching process. Then, the 
block matching unit 3 performs an operation for the block matching process indicated by the following equation (1 ) to 
the current block pixel data and the reference block pixel data. The block matching process is performed ail the refer- 
ence blocks of the search window. 
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D{k,m) - V J \R(k + i - l,m + j - 1)C(/, ;)| equation ( 1 ) 



where C- ■ is one pixel data of the current block, Rj l is one pixel data of the current block, and M and N are the number 
of columns and the number of rows in the current block, respectively. The reference block having the maximum likeh- 
hood, i e , a minimum value as a result of the block matching process operation is determined for estimating a motion 



vector 

w The motion vector storage unit 5 stores previously estimated motion vectors. The motion vector monitoring unit 4 

stores the motion vector outputted from the block matching unit 3 in the motion vector storage unit 5. Also, the motion 
vector monitoring unit 4 outputs a search window designation data to the block matching unit 3 based on the motion 
vectors stored in the storage unit 5. In other words, the motion vector monitoring unit 4 determines estimation history 
of the motion vectors based on the motion vectors stored in the motion vector storage unit 5. The motion vector mon- 
is itoring unit 4 determines the size, shape and position of the search window based on the determined estimation history. 
That is the motion vector monitoring unit 4 refers to the motion vector storage unit 5 to determine whether the number 
of previously estimated motion vectors exceeds a predetermined number. When it is determined that the number of 
previously estimated motion vectors is less than the predetermined number, the motion vector monitoring unit 4 neglects 
the previously estimated motion vectors and determines optimal size and shape of the search window based on the 
20 current block When it is determined that the number of previously estimated motion vectors exceeds the predetermined 
number the motion vector monitoring unit 4 produces a a-dimensional map in which each of the previously estimated 
motion vectors is mapped based on its horizontal and the vertical values. Thus, the motion vector monitoring unit 4 
determines optimal size and shape of the search window such that the search window includes the history of the 
previously estimated motion vectors. Also, the motion vector monitoring unit 4 calculates average values of changes 
25 of the previously estimated motion vectors in the horizontal and vertical directions, respectively, and determines an 
optimal position of the search window such that, for example, a center position of the search window is shifted based 
on the average values. In this manner, the size, shape and position may be independently determined. On the contrary, 
the size, shape and position may be collectively determined. 

The motion vector monitoring unit 4 outputs the search window designation data 1 61 to the block matching unit 3. 
so Thus the size, shape and position of the search window may be changed in response to the search window designation 
data for every frame, tor every block, or if necessary. Therefore, the search window determining process is not a bottle 
neck to restrict the processing speed of the block matching process. 

In this manner, the shape, size and position of the search window is determined based on the estimation history 
of the previous motion vectors. Then, the block matching process is carried out between the current block and each 
35 of the reference blocks within the search window to detect a motion vector. 

Also, in the present invention, the size, shape and position of the search window may be adaptively determined. 
Therefore, the size, shape or position of the search window is changed. 

Fig 2 is a block diagram illustrating the structure of the block matching unit 3 according to the first embodiment 
of the present invention. Referring to Fig. 2, the block matching unit 3 is composed of a control circuit 10-1 and a block 
40 matching circuit 300-1 . The block matching unit 3 is formed on a semiconductor chip. However, only the block matching 
circuit 300-1 may be formed on the semiconductor chip. 

The control circuit 10-1 receives the search window designation data 161 from the motion vector monitoring unit 
4 to read the current block pixel data from the current picture data storage unit 1 and the pixel data for each of the 
reference blocks within the search window. 
45 The block matching circuit 300-1 is composed of a plurality of processing elements 21 arranged in M columns and 

N rows (M and N are positive integers, respectively, and M = 4 and N = 4 in this example). The processing elements 
21 are cascade-connected in units of rows and the rows are cascade-connected. The last processing element PE 4 4 
of the cascade connection is connected to a minimum value detecting circuit 25. The minimum value detecting circuit 
25 detects one of the reference blocks having the minimum value in the block matching process within the search 
50 window and outputs the minimum value reference block as a motion vector. 

It is not required that the number of processing elements is equal to the number of pixels of the current block. 
However, if the total number of processing elements PE MiN is made equal to the number of pixels of the current blocks, 
FIFO units which are required in the conventional system can be omitted. As a consequence, the chip area of the block 
matching unit 3 can be reduced, and the power consumption can be reduced. 
55 Also the block matching circuit 300-1 is composed of (P + Q) reference block pixel data buses (P or Q is a positive 

integer more than 1), a first group of M selectors, and a second group of N selectors. In this example, the reference 
block pixel data buses are the Rv1 , Rv2, Rh1 and Rh2 reference block pixel data buses 301 , 302, 303 and 304 in this 
example Sequences of pixel data for the reference blocks are supplied to the Rv1 , Rv2, Rh1 and Rh2 reference block 
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pixel data buses 301, 302, 303 and 304 from the reference picture data storage unit 2. Selectors 2211 to 2214 are 
provided as the first group of M selectors and are connected to the Rv1 and Rv2 reference block pixel data buses 301 
and 302. Also, selectors 2221 to 2224 are provided for the second group of N selectors and are connected to the Rh1 
and Rh2 reference block pixel data buses 303 and 304. The selectors 2211 to 2214 and 2221 to 2224 are controlled 
s in response to control signals from the control circuit 10-1 , respectively. 

The block matching circuit 300-1 is further composed of Rh reference data buses 206-1 to 206-4, Rv reference 
data buses 207-1 to 207-4 and current data buses 208-1 to 208-4. Each of the Rh reference data buses 206-1 to 206-4 
is provided for one of the selectors 2221 to 2224, and each of the Rv reference data buses 207-1 to 207-4 is provided 
for one of the selectors 2211 to 2214. Each of the reference data buses 206-1 to 206-4 and 207-1 to 207-4 transfers 
10 one reference block pixel data selected by the corresponding selector. Each of the current data buses 208-1 to 208-4 
are provided for one of the rows of processing elements 21 in this example, and the current data buses 208-1 to 208-4 
are connected to each other. Each of the plurality of processing elements is also connected to a corresponding one of 
the Rh reference data buses 206-1 to 206-4, a corresponding one of the Rv reference data buses 207-1 to 207-4, and 
a corresponding one of the current data buses 208-1 to 208-4. It should be noted that although the current data buses 
is 208-1 to 208-4 are commonly connected in the structural example shown in Fig. 2, the current data buses 208-1 to 
208-4 are not always commonly connected. 

The block matching circuit 300-1 is further composed of a circuit 11 . The circuit 11 switches the pixel data read 
from the reference picture data storage unit 2 between the reference block pixel data buses 301 to 304. The circuit 11 
may includes shift registers, each of which latches one row of reference block pixel data and outputs the pixel data of 
20 the row one by one. Thus, the read operation from the reference picture data storage unit 2 can be simplified. However, 
the reference block pixel data buses 301 to 304 may be directly connected to the reference picture data storage unit 
2. In this case, the circuit 11 can be omitted and the read operation is controlled by the control circuit 10-1. 

Fig. 3 is a block diagram illustrating the structure of each of the plurality of processing elements 21 and the pe- 
ripheral portion in the first embodiment. Referring to Fig. 3, the processing element (PE) 21 is connected to the output 
25 * port Do of a previous stage of processing element 21 via a signal line 203 and the input port Di of a next stage of 
processing element via a signal line 204. The processing element is connected to the corresponding current data bus 
208 at a terminal C, and to the corresponding Rh and Rv reference data buses 206 and 207 through a selector 23, 
which is not shown in Fig. 3. The selector 23 is controlled in response to a control signal from the control circuit 10-1 
One of reference pixel data on the corresponding Rh reference data bus 206 and reference pixel data on the corre- 
30 sponding Rv reference data bus 207 is selected by the selector 23. Then, the processing element 21 performs a 
calculation in response to a control signal from the control circuit 10-1 using the current pixel data received from the 
corresponding current data bus 208 and the selected reference pixel data. In the calculation, an absolute value of a 
difference between the current pixel data and the selected reference pixel data is calculated. In response to a control 
signal from the control circuit 10-1, the calculation result is added to the data received at the terminal Di from the 
35 previous stage of processing element. The adding result is outputted from the terminal Do for the next stage of process- 
ing element such that pipeline processing is allowed. The calculation and the addition are performed synchronously 
between the processing elements in response to the control signals or clock signals from the control circuit 1 0-1 . Thus, 
the block matching process can be performed in a pipeline manner. 

Fig. 4 is a block diagram illustrating the structure of the processing element as a minimum unit for constituting the 
40 block matching circuit 300 shown in Fig. 2. Referring to Fig. 4, the processing element 21 is composed of registers 
11a, 11b and 11c, a calculating element 12 and an adding element 13. The registers 11a, 11b and 11c are pipeline 
registers. The calculating element 12 calculates an absolute value (T = I IR - CI I) 205 of a difference between the 
pixel data CI of the current block which is supplied from the current data bus 208 via a signal line 202 and the pixel 
data IR of the reference block which is supplied from the selector 23 via a signal line 201 . The adding element 1 3 adds 
45 the data Di received at the Di port via the signal line 203 and the absolute value T 205, namely, (Di + T), and then 
outputs the summation from the Do port onto a signal line 204. In this manner, the pipeline processing is performed. 

Fig. 5 is a block diagram illustrating the motion vector monitoring unit 4. Referring to Fig. 5, the motion vector 
monitoring unit 4 is composed of a filter 1 201 , a search window determining circuit 1 202, and a window data storage 
unit 1203. The filter 1201 filters the motion vector estimated by the block matching unit 3. The filtered motion vector is 
50 stored in the motion vector storage unit 5. The search window determining circuit 1202 refers to the motion vector 
storage unit 5 to determine a search window from the estimation history of previously estimated motion vectors stored 
in the motion vector detecting unit 5. Also, the search window determining circuit 1202 stores a data designating the 
determined search window in the window data storage unit 1 203. It should be noted that the filter 1 201 may be replaced 
by another filter, or may be omitted. 
55 The above-explained search window designating data is composed of, for instance, an upper left position, height 

and width of each of rectangular reference regions of the determined search window, which will be described below. 

An example of the search window used in the present invention will be described below with reference to Figs. 6A 
and 6B. Fig. 6A is a diagram schematically illustrating the search window and Fig. 6B is a partial expanded diagram 
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of the search window shown in Fig. 6 A Referring to Fig. 6A, the search window 1100 is composed of a plurality of 
rectangular reference regions 1101, 1102, 1103, and 1104, which overlap one after another. That is, the reference 
regions 1 1 01 , 1 1 02, 1 1 03, and 1 1 04 are synthesized to form the search window 1 1 00. The search window 1 1 00 in the 
present invention is not necessarily required to have a rectangular shape, unlike the conventional system. In the block 

s matching unit 3 according to the first embodiment of the present invention, even when the search window is not rec- 
tangular but has a concave region and/or a convex region, this search window may be divided into the plurality of 
rectangular reference regions. 

On the other hand, a conventional search window 1 1 1 0 is illustrated by a dot line in Fig. 6 A and has a rectangular 
shape. The search window 1100 is different from that of the conventional example in regions 1111, 1112, 1113 and 

10 1114. These regions are determined by the search window determining circuit 1202 to be unnecessary to perform the 
block matching process. In the present invention, the block matching process is not performed to the regions 1111, 
1112, 1113 and 1114. Therefore, the block matching process can be completed at high speed. Even when the block 
matching process is carried out in this manner, since the operation speed of the processing elements 21 can be main- 
tained at 100%, this shape of the search window never give adverse influence to the process time and the power 

is consumption. 

Referring back to Fig. 1 , the motion vector monitoring unit 4 monitors and stores the motion vectors obtained from 
the block matching unit 3 in the motion vector storage unit 5. The motion vector monitoring unit 4 changes the shape, 
size, and/or position of the search window based on the motion vectors stored in the motion vector storage unit 5, i. 
e., the estimation history of the previously estimated motion vectors. Because the block matching unit 3 according to 

20 the first embodiment of the present invention is applied to the search window having any concave regions and convex 
regions, the efficiency of the block matching process is not decreased. As a consequence, since the estimation history 
of the previously estimated motion vector is more effectively reflected, there is no need to carry out the block matching 
process to the waste reference regions such as the regions 1111 , 1112, 1113 and 1114. Thus, the resultant processing 
time and the resultant power consumption can be reduced. 

25 Also, even if the size of the search window is changed based on the estimation history of the motion vectors, the 

number of reference regions is merely increased and/or decreased. There is no change in the rule of the sequences 
of pixel data supplied into the block matching unit 3, as described later. As a consequence, in accordance with this 
embodiment, the size, shape and/or position of the search window can be changed without making the block matching 
unit 3 complex. 

30 in the present invention, the size, shape, and/or the position of the search window is adaptively changed based 

on the previous motion vectors. As a result, the block matching process is not performed to the regions 1111 to 1114 
which are not originally required to perform the block matching process. Therefore, the processing time and amount 
can be reduced. As a consequence, the power consumption required for the block matching process can be reduced. 
Also, there is no detection of any erroneous motion vector by performing the block matching process operation to the 

35 improper regions. 

Next, the operation of the block matching process in the motion vector estimating apparatus according to the first 
embodiment of the present invention will be described below in detail. In this example, it is assumed that the current 
block is composed of 16 pixels of 4 x 4 (C 1t1 to C 44 ). The search window is shown in Fig. 6 A. The reference region 
1 1 01 of the search window is composed of 49 pixels of 7x7 (R lt1 to R 7 7 ) and the reference region 1 1 02 of the search 
40 window is composed of 105 pixels of 7 x 15. Also, it is assumed that the block matching circuit 300 includes (4 x 4) 
processing elements as shown in Fig. 2. 

Fig. 7 shows the relation of the previously estimated motion vectors, the search window, the reference block and 
the current block. 

In the start of the block matching process, the motion vector monitoring unit 4 refers to the motion vector storage 
45 unit 5 and determines the search window based on the motion vectors previously estimated. In this case, the motion 
vector monitoring unit 4 determines from the previously estimated motion vectors that there is no case where a new 
motion vector is found out in the regions 11 1 1 , 1 1 1 2, 1 1 1 3 and 1 1 14, and determines the search window, as shown in 
Fig. 6A Then, the reference regions 1101 to 1104 constituting the search window are determined and data designating 
each of the reference regions 1101 to 1104 is stored in the window data storage unit 1203. Also, the data designating 
50 the reference regions as the search window designation data is supplied to the control circuit 1 0-1 of the block matching 
unit 3. The control circuit 10-1 determines pixel data to be read out from the current picture data storage unit 1 and the 
reference picture data storage unit 2. The pixel data read out from the current picture data storage unit 1 and the 
reference picture data storage unit 2 are supplied to the block matching circuit 300-1 of the block matching unit 3. Thus, 
the block matching process is started. 
55 Figs. 8A and 8B are diagrams schematically illustrating the pipeline processing for the block matching process. In 

Figs. 8A and 8B, an abscissa denotes time. Symbols C 1fl to C 4 4 represent pixel data of the current block. Symbols 
Rt t to R7 7 represent pixel data of the reference region 1101, and symbols A to S 44 and R 5>1 to Ry 7 and so on 
represent pixel data of the reference region 1102 as shown in Fig. 6B. Furthermore, each of symbols D, , to D 44 
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represents data which indicates a likelihood of the corresponding reference block and which is supplied to the minimum 
value detecting circuit 25. The likelihood data is calculated based on the following equation (2): 



D(k,m) = y y\R(k + 1 - 1, m + ; - 1) - C(iJ)\ equation ( 2 ) 



In this formula, symbol V is selected to be 1 to 4, and symbol "m" is selected to be 1 to 4. 

10 As apparent from Figs. 8A and SB, four different sequences of pixel data 81 to 84 supplied to the processing 

elements are shown by symbols O, X, □ and A. As illustrated in Figs. 8A and 8B, the sequences of pixel data 81 to 84 
are sequentially supplied to the block matching circuit 300-1 . That is, at a time 1 , a pixel data C t t1 of the current block 
is supplied to the processing element PE 1(1 via the current data bus 208-1 . At the same time, a pixel data of the 
reference block of the reference region 1101 (Fig. 6B) is supplied to the processing element PE U from the sequence 

is of pixel data 81. In the processing element PE hV these pixel data are latched by the registers 11a and 11b and an 
absolute value of a difference between these pixel data, i.e., I - C ltl I is calculated by the calculating element 1 2. 
Because the processing element PE 1 t1 is a head element of the.cascade connection so that the data Di of "0" is supplied 
from the register 11c, the processing element PE 1(1 outputs the absolute value of the difference from the output port 
Do to the input port Di of the next stage of processing element PE 2 , at a time 2. 

20 At the time 2, a pixel data C 2 1 of the current block is supplied to the processing element PE 21 from via the current 

data bus 208-1 . The pixel data C 21 of the current block is supplied to the processing element PE 2 -,. At the same time, 
a pixel data of the reference block is supplied to the processing elements PE<, and PE 2 1 from the sequence of 
pixel data 81. 

In the processing element PE 1fl , since the pixel data C 1f1 has been already latched by the register 11a, the pixel 
25 data R 2 , is latched by the register 11b and then an absolute value of a difference between these pixel data Rg t1 and 
C 1 1t i.e., I R 2 t - C 1 ! I is calculated by the calculating element 12. The processing element PE 1t1 outputs the absolute 
value of the difference from the output port Do to the input port Di of the next stage of processing element PE 2>1 at a 
time 3. Also, in the processing element PE 21 , the pixel data C 2t1 and R 21 are latched by the registers 11a and 11b 
and an absolute value of a difference between these pixel data, I R 2(1 - C 2 A I is calculated by the calculating element 
30 12. The adding element 13 adds the absolute value of the difference to the calculation result I R 1 j - C 1t1 I supplied 
from the processing element PE, -, via the input port Di. The processing element PE 2 ,i outputs the summation absolute 
value of the difference from the output port Do to the input port Di of the next stage of processing element PE 3j1 at a 
time 3. 

After a time 4, the processing elements PE 1tlI PE 2 v PE 3j1 and PE 4-1 are in the following holding states. That is, 
35 in the processing element PE 41> a data of I H^ A -C^ l + IR 2>1 -C 2tl I + I R 3>1 -C 31 l + IR 4il -C 41 I is stored. Similarly, 
in the processing element PE 3 v a data of I R 21 - C t -, I + I R 31 - C 21 I + I R 41 - C 31 I is stored. In the processing 
element PE 21 ,adataof I R 31 - C 1A \ + \ R 41 - C 21 I is stored. In the processing element PE, v a data of I R 41 - C 1<n 
I is stored. 

At a time 5, a pixel data C n 2 of the current block is supplied to the processing element PE^ from via the current 
40 data bus 208. The pixel data o\ 2 of the current block is latched in the processing element PEj 2 . At the same time, a 
pixel data R 1 2 of the reference block is supplied to the processing elements PE, (1 and PE 1 2 from the sequence of 
pixel data 82. Also, a pixel data R 51 of the reference block is supplied to the processing elements PE^, PE 3 A and 
PE 4 J from the sequence of pixel data 81 . 

In the processing element PE 1(1 , since the pixel data C A A has been already latched by the register 11a, the pixel 
45 data R 1t2 is latched by the register lib and then an absolute value of a difference between these pixel data F^ , and 
C-, -i, i.e., I Ri 2 - j I is calculated by the calculating element 1 2. The processing element PE, t1 outputs the absolute 
value of the difference from the output port Do to the input port Di of the next stage of processing element PE 2 , at a 
time 6. 

Also, in the processing element PE 2 ?1 , since the pixel data C 2 A has been latched by the register 11a, the pixel 
so data R 51 is latched by the register 11b and an absolute value of a difference between these pixel data, I R^ - C 21 I 
is calculated by the calculating element 12. The adding element 13 adds the absolute value of the difference to the 
calculation result supplied from the processing element PE 1 11 via the input port Di. The processing element PE 2 A 
outputs the summation absolute value of the difference from the output port Do to the input port Di of the next stage 
of processing element PE 3 A at the time 6. 
ss Also, in the processing element PE, 2 , since the pixel data C t 2 and R n 2 are latched by the registers 1 1 a and 1 1 b 

and an absolute value of a difference between these pixel data, I 2 - ^ I is calculated by the calculating element 
1 2. The adding element 1 3 adds the absolute value of the difference to the calculation result supplied from the process- 
ing element PE 41 via the input port Di. The processing element PE^ outputs the summation absolute value of the 
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difference from the output port Do to the input port Di of the next stage of processing element PE 2i 2 at the time 6. 
The operation described above is repeated from the time 6 to a time 16. 

At a time 17. the likelihood D t A of the reference block having the pixel data of R t>1 to R4 4 is obtained. In this 
manner, the likelihoods D, , to D 4 4 for the reference blocks of the reference region 1101 can be sequentially obtained 
5 in a time interval from the time 1 7 to time 32 in a pipeline manner 

At the same time, at the time 17, a pixel data of the next reference region 1102 (Fig. 6B) is supplied to the 
processing element PE., ^ from the sequence of pixel data 83 with no waste rest. The pixel data S lfl is used together 
with the pixel data C 1 j to calculate an absolute value of a difference between the pixel data S 1 ^ and C 1t1 . In this 
manner, the block matching process to the next reference region 1 1 02 is started in a continuous state from the reference 
10 region 1 101 with no wasted processing time. 

In this manner, 16 pixel data of the current block in maximum are used at the same time. However, because the 
same current block pixel data are used in the same processing elements, the number of pixel data of the current block 
to be supplied at the time can be reduced to "1" if the pixel data already supplied is latched. In other words, in the block 
matching process shown in Figs. 8A and 8B, since the block matching process is carried out to the same current block, 
is the entry of the pixel data ^ to C 4 4 of the current block to the processing elements PE, ., to PE 4 4 is complete from 
the time 1 to the time 16. Also, the number of pixel data to be entered at the same time is selected to be 1 . 

On the other hand, the pixel data of the reference region are simultaneously supplied to 1 t 2, 3 or 4 processing 
elements. The sequence of pixel data 81 and the sequence of pixel data 82 are the pixel data present within the same 
reference region 1101 shown in Figs. 6A and 6B. Also, the sequence of pixel data 83 and the sequence of pixel data 
20 84 are the pixel data present within the same reference region 1 1 02. There is no problem even when these two reference 
regions 1101 and 1102 are overlapped with each other. 

According to the first embodiment of the present invention, the size, shape, and/or position of the search window 
is flexibly determined. Therefore, the bus wiring lines of the block matching circuit 300-1 are formed to be adaptive for 
the search window which is divided into a plurality of reference regions. As a consequence, even when the block 
25 matching process is performed in the parallel manner, this parallel degree is not decreased. The block matching process 
can be performed within a short time, and the power consumption can be reduced. Also, according to the first embod- 
iment of the present invention, even when the search window is made small, the motion vector can be detected in high 
precision. 

Next, the supply of the pixel data of the reference region will be described below. Figs. 9 A shows the cascade 
30 connection of the processing elements PE., (1 to PE 44 , as shown in Fig. 2. Figs. 9B to 9F show the manner in which 
the pixel data of the reference regions are supplied at the times 1 , 2, 5, 9 and 1 7 of Figs, 8A and 8B. 

Referring to Fig. 9B, the pixel data H A ^ of the sequence of pixel data 81 is supplied on, for example, the Rv1 
reference block pixel data bus 301 . At this time, the selector 2211 is controlled in response to a control signal from the 
control circuit 10-1 to select the pixel data on the Rv1 reference block pixel data bus 301 such that the selected pixel 
35 data is transferred onto the reference data bus 207-1 . Also, the selector 23 corresponding to the processing element 
PE., A is controlled in response to a control signal from the control circuit 10-1 to select the pixel data on not the Rh 
reference data bus 206-1 but the Rv reference data bus 207-1 . Thus, the pixel data F^ (1 can be supplied to the process- 
ing element PE 1 v 

Next, as shown in Fig. 9C, at the time 2, the pixel data Ffej is supplied in the same manner as the pixel data R 1>v 
40 The selectors 23 corresponding to the processing elements PE-, t1 and PE 2 11 are controlled in response to control 
signals from the control circuit 10-1 to select the pixel data on not the Rh reference data bus 206-1 but the Rv reference 
data buses 207-1 and 207-2. Thus, the pixel data Rg can be supplied to the processing elements PE t ., and PE 21 . 

Next, as shown in Fig. 9D, at the time 5, the pixel data R 5 11 of the sequence of the pixel data 81 is supplied on the 
Rv reference block pixel data bus 301 in the same manner as the pixel data R, v Also, the pixel data R, 2 of the 
45 sequence of the pixel data 82 is supplied on, for example, the Rv reference block pixel data bus 302. At this time, the 
selector 2211 is controlled in response to a control signal from the control circuit 10-1 to select the pixel data F^ 2 on 
the Rv reference block pixel data bus 301 such that the selected pixel data is transferred onto the reference data bus 
207-1 . The selectors 23 corresponding to the processing elements PE 2 t0 PE 4,i are controlled in response to control 
signals from the control circuit 10-1 to select the pixel data R5 ., on not the Rh reference data bus 206-1 but the Rv 
50 reference data buses 207-2 to 207-4. Thus, the pixel data R 5 j can be supplied to the processing elements PE 2 (1 to 
PE 41 . Also, the selectors 23 corresponding to the processing elements PE 1 ^ to PE^ 2 are controlled in response to 
control signals from the control circuit 10-1 to select the pixel data R, 2 on not the Rh reference data buses 206-1 and 
206-2 but the Rv reference data bus 207-1 . Thus, the pixel data R-, 2 can be supplied to the processing elements PE 1 ^ 
and PE^ 2 . 

55 Next, as shown in Fig. 9E, at the time 9, the selectors 2211, 2221 and 2222 are controlled in response to control 

signals from the control circuits 10-1 such that the pixel data F^ 3 of the sequence of the pixel data 81 is supplied to 
the Rv reference data bus 207-1 and the pixel data R$ 2 of the sequence of the pixel data 82 is supplied to the Rh 
reference data buses 206-1 and 206-2. Then, the selectors 23 corresponding to the PE AA to PE^ 3 are controlled by 
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the control circuit 10-1 such that the pixel data R 1 3 is supplied to the processing elements PE 1#1 , PE 1 2 and PE A 3 and 
the pixel data R 5 2 is supplied to the processing elements PE 2 to PE 4 1 and PE 2f2 to PE 4 ^. 

Next, as shown in Fig. 9F, at the time 17, the sequence of pixel data 81, the sequence of pixel data 83 and the 
sequence of pixel data 82 are supplied to the Rh1 reference block pixel data bus 303, the Rh2 reference block pixel 
s data bus 304, and the Rv1 reference block pixel data bus 301 , respectively. The selectors 221 1 to 221 4 are controlled 
in response to control signals from the control circuit 1 0-1 such that the pixel data 4 is transferred to the Rv reference 
data buses 207-1 to 207-4. The selectors 2221 to 2224 are controlled in response to control signals from the control 
circuit 10-1 such that the pixel data S 1 1 is transferred to the reference data bus 206-1 and the pixel data Rj 5 is 
transferred to the reference data buses 206-2 to 206-4. Also, the selectors 23 corresponding to the processing elements 

10 PE 2 A to PE 4 1( PE 2 2 to PE 4 £, PE 2 3 to PE 4 3 and PE 24 to PE 4 4 are controlled by the control circuit 10-1 such that the 
pixel data R 5 4 is supplied to the processing elements PE 2 A to PE 4 v PE 2 2 to PE 42 , PE 2 3 to PE 4 3 and PE 2 4 to PE 4 4 . 
The selectors 23 corresponding to the processing element PE 1 ^ is controlled by the control circuit 10-1 such that the 
pixel data S 1(1 is supplied to the processing element PE^ A and the selectors 23 corresponding to the processing 
elements PE 1 2 to PE 1 4 are controlled by the control circuit 10-1 such that the pixel data Rj 5 is supplied to the process- 
's ing elements PE, 2 to PE, 4 . 

Fig. 10 is a diagram illustrating a method for supplying the pixel data to the processing elements when four kinds 
of pixel data are supplied to the processing elements at a time. As shown in Fig. 10, the processing elements are 
divided into four regions 1121, 1122, 1123 and 1124 by boundary lines 1131, 1132 and 1133. 

The example shown in Fig. 1 0 corresponds to the state of the time 22 in Figs. 8A and 8B. In this case, the supply 

20 of the pixel data from 4 reference block pixel data buses 301 to 304 to 4 regions 11 21 to 1 1 24 in Fig. 1 0 are controlled 
by moving the boundary lines 11 31 to 1133. That is, the selectors 2211 and 2212 are controlled by the control circuit 
10-1 such that the sequence of pixel data 84 can be supplied to the Rv reference data buses 207-1 and 207-2. The 
selectors 2213 and 2214 are controlled by the control circuit 10-1 such that the sequence of pixel data 81 can be 
supplied to the Rv reference data buses 207-3 and 207-4. The selectors 2221 and 2222 are controlled by the control 

2$ circuit 10-1 such that the sequence of pixel data 83 can be supplied to the Rh reference data buses 206-1 and 206-2. 
The selectors 2223 and 2224 are controlled by the control circuit 10-1 such that the sequence of pixel data 82 can be 
supplied to the Rh reference data buses 206-3 and 206-4. If the selectors 23 corresponding to the processing elements 
PE 1 } to PE 44 are controlled, the four kinds of sequences of pixel data can be supplied to the processing elements 
PE^ to PE 4 4 as shown in Fig. 1 0. That is, in the block matching unit 3 according to the first embodiment of the present 

30 invention, the four kinds of pixel data can be supplied to the processing elements at the same time by properly moving 
boundary lines 1131 to 1133. 

In this manner, since 4 kinds of pixel data over the reference regions in maximum and also one kind of pixel data 
of the current block can be supplied, the operation rate of the processing elements can be increased to 100%. 

Also, even when the block matching process is carried out to a plurality of reference regions, the block matching 

35 process can be successively executed while changing the reference regions, in the state in which the operation rate 
of the processing elements are maintained at 100%. As a result, in this embodiment, the time period required for 
executing the block matching process can be shortened. When it is supposed that the time period for executing the 
block matching process is a predetermined constant value, the operation frequency of a clock signal for the block 
matching unit 3 can be reduced, as compared with that of the conventional system. As a consequence, the power 

40 consumption of the block matching circuit can be reduced. 

Also, since the time period for performing the necessary block matching process can be shortened, the block 
matching process can be carried out in the wider reference region. 

According to the present invention, the pixel data of the reference region is supplied via a hierarchical selector 
structure, i.e., the two stages of selectors to the bus wiring line connected to the processing elements as the minimum 

45 units of the block matching circuit 300-1 . As a consequence, even when the block matching process is carried out to 
the plurality of reference regions 1101 to 1104, it is possible to execute the block matching process in such a manner 
that the pipelines of the block matching circuit 300-1 are continuously filled without a wasted processing time. That is, 
the parallel degree of the block matching process is not decreased. As a consequence, even when the search window 
is divided into the plurality of reference regions, the processing time of the block matching process is not increased 

50 and the power consumption is not increased. 

Also, the local wiring lines 203 and 204 in Fig. 4 are employed in the wiring line system used to connect the 
processing elements as the minimum units of the block matching circuit 300-1. Therefore, when the minimum value 
or likelihood is calculated from the summation of the difference absolute values, the total number of the wiring lines 
from all the processing elements to the minimum value detecting circuit 25 can be suppressed. As a consequence, 

55 the motion vector estimating apparatus can be realized with the smaller chip area and with power consumption reduced. 

Next, the motion vector estimating apparatus according to the second embodiment of the present invention will 
be described below. Fig. 11 shows the structure of the block matching unit 3 of the motion vector estimating apparatus 
in the second embodiment. Referring to Fig. 11 , the block matching unit 3 is composed of a control circuit 10-2 like the 
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control circuit 10-1 in the first embodiment, and a block matching circuit 300-1. 

The block matching circuit 300-2 includes processing elements PE U to PE M N of M columns and N rows (M and 
N are positive even integers, respectively), each of which is the same as the processing element in the first embodiment. 
In this embodiment, M is 4 and N is 8. The current block is composed of (M x N) pixels, i.e., 4 x B in this example. The 

5 processing elements are cascade-connected via the output port and the input port in units of rows. The first row is 
connected to the third row via a FIFO unit 24-1 in cascade connection manner. A minimum value detecting circuit 28-1 
is connected to the last stage of processing element PE 4 3 via a FIFO unit 24-2. The second row is connected to the 
fourth row via a FIFO unit 25-1 in cascade connection manner. A minimum value detecting circuit 28-2 is connected 
to the last stage of processing element PE 4>4 via a FIFO unit 25-2. Also, the fifth row is connected to the seventh row 

10 via a FIFO unit 26-1 in cascade connection manner. A minimum value detecting circuit 28-3 is directly connected to 
the last stage of processing element PE 4 7 . The sixth row is connected to the eighth row via a Fl FO unit 27-1 in cascade 
connection manner. A minimum value detecting circuit 28-4 is directly connected to the last stage of processing element 

In other words, as shown in Fig. 11 , in the processing elements of the odd rows in an upper half of block matching 
is circuit 300-2, the input and output ports 203 and 204 are cascade-connected via the FIFO units 24-1 and 24-2. In the 
processing elements of the even rows in the upper half of the block matching circuit 300-2, the input and output ports 
203 and 204 are cascade-connected via the FIFO units 25-1 and 25-2. In the processing elements of the odd rows in 
the lower half of the block matching circuit 300-2, the input and output ports 203 and 204 are cascade connected via 
the FIFO unit 26-1 . In the processing elements of the even rows in the lower half of the block matching circuit 300-2, 
20 the input and output ports 203 and 204 are cascade-connected via the FIFO unit 27-1 . 

The stage numbers of the FIFO units 24-1 , 25-1 , 24-2, 25-2, 26-1 and 27-1 are 4, 4, 20, 1 6, 4 and 4, respectively, 
as shown in Fig. 11. 

Rh reference data buses 206-1 to 206-8 are provided for the rows, respectively, and Flv reference data buses 
207-1 to 207-4 are provided for the columns, respectively. The relation of one of the processing elements and the 

25 corresponding Rh and Rv reference data buses is the same as in the first embodiment. The RV reference block pixel 
data buses 301 and 302 are provided and one of them is connected to the Rv reference data buses 207-1 to 207-4 
via selectors 2211 to 2214, respectively. The Rh reference block pixel data buses 303 and 304 are provided and one 
of them is connected to the Rh reference data buses 206-1 to 206-8 via selectors 2221 to 2228, respectively.- 

The current data buses 208-1 to 208-8 are provided for the rows, respectively, and are connected together. It 

30 should be noted that although the current data buses 208-1 to 208-8 are commonly connected in the structural example 
shown in Fig. 11 , these current data buses 208-1 to 208-8 are not always commonly connected. 

One of pixel data of four sequences on the Rv and Rh reference block pixel data buses 301 to 304 is supplied to 
one processing element via one of the selectors 2211 to 2214 and one of the selectors 2221 to 2228, and then the 
selector 23 corresponding to the processing element, as in the first embodiment. 

35 In Fig. 1 1 , if the total number of processing elements is set equal to the number of pixels, M x N, of the current 

block, the stage number in each of the FIFO units can be decreased. As a result, the chip area of the block matching 
circuit 300-2 can be further reduced, and also the power consumption can be reduced. 

Also, in Fig. 1 1 , it is assumed that the number of stages of each of the FIFO units 24-1 , 25-1 , 26-1 , 27-1 is selected 
to be M, that the number of stages of the FIFO unit 24-2 is selected to be (M x N)/2 + M, and that the number of stage 

40 of the FIFO unit 25-2 is selected to be (M x N)/2. In this case, a summation of an output 305 from the FIFO unit 24-2, 
an output 306 from the FIFO unit 25-2, an output 307 from the processing element PE M>N . 1t and an output 308 from 
the processing element PE M N can be equal to the output 310 shown in Fig. 2. Further, according to the second em- 
bodiment of the present invention, the data entry from the reference regions is scheduled in such a manner that the 
number of stage of each of the FIFO units can be simultaneously reduced. As a consequence, the block matching 

45 circuit can be realized such that the chip area can be saved and the power consumption can be reduced. 

In the case that the present invention is applied to the MPEG-2 (ISO/lEC 13818-2), if the circuit structure shown 
in Fig. 11 is employed, there can be obtained at the same time the data which are required in a frame motion compen- 
sating prediction of the motion compensating prediction in a frame structure, a field motion compensating prediction, 
a field motion compensating prediction of the motion compensating perdition in the field structure, and the 16 x 8 motion 

50 compensating prediction. In addition, the processing elements are divided into four kinds of processing elements as 
the respective processing elements belonging to the upper odd rows, the upper even rows, the lower odd rows, and 
the lower even rows. Therefore, the motion vectors can be simultaneously acquired in response to a plurality of pre- 
diction modes of MPEG-2 (ISO/lEC 13818-2). In this case, there are only the local wiring lines among the wiring lines 
connected from the processing elements, and the chip area can be reduced. 

55 Next, the motion vector estimating apparatus according to the third embodiment of the present invention will be 

described below. Fig. 12 shows the structure of a block matching circuit 300-3 in the third embodiment. The block 
matching circuit 300-3 includes processing elements of 8 columns and 16 rows. The structural concept of the block 
matching circuit shown in Fig. 12 is the same as that of the block matching circuit in the second embodiment. The 
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difference point is in the number of processing elements. 

Like in the second embodiment, in the processing elements of the odd rows in an upper half of block matching 
circuit 300-3, the input and output ports 203 and 204 are cascade-connected via three FIFO units. In the processing 
elements of the even rows in the upper half of the block matching circuit 300-3, the input and output ports 203 and 204 
5 are cascade-connected via three FIFO units. In the processing elements of the odd rows in the lower half of the block 
matching circuit 300-3, the input and output ports 203 and 204 are cascade<onnected via three FIFO units. In the 
processing elements of the even rows in the lower half of the block matching circuit 300-3, the input and output ports 
203 and 204 are cascade-connected via three FIFO units. Each of the above-mentioned FIFO units has the stages of 8. 

TTie last stage of processing element of the odd rows in the upper half is connected to adders 31 and 32 via a 
10 Fl FO unit having the stages of 72. The last stage of processing element of the even rows in the upper half is connected 
to the adder 31 and an adder 33 via a FIFO unit having the stages of 64. The last stage of processing element of the 
odd rows in the lower half is directly connected to the adder 32 and an adder 34. The last stage of processing element 
of the even rows in the lower half is directly connected to the adders 33 and 34. The outputs of the adders 32 and 33 
are connected to an adder 35. 

is in this embodiment, in spite of decreasing the number of stages in each FIFO unit, three matching criteria needed 

for MPEG2, i.e., 16 x 16, 16 x 8 (top) and 16x8 (bottom) for the frame structure can be obtained at the same time. 
Also, three matching criteria, i.e., 16 x 16, 16 x 8 (upper) and 16x8 (tower) for the field structure can be obtained at 
the same time. 

Fig. 13 is a diagram to explain the supply of the reference block pixel data. The reference block pixel data are 

20 supplied from four reference block pixel data buses while changing the boundary lines, as in the first embodiment. 
Therefore, the detail description is omitted. 

• Next, the motion vector estimating apparatus according to the fourth embodiment of the present invention will be 
described below. The basic structure of the motion vector estimating apparatus in the fourth embodiment is the same 
as that of the first embodiment. Therefore, the different point will be described. 

2S Fig. 1 4 shows the structure of the motion vector monitoring unit 4' of the motion vector estimating apparatus in the 

fourth embodiment. Referring now to Fig. 14, the motion vector monitoring unit 4' is composed of the filter 1201 for 
filtering the motion vectors obtained from the block matching unit 3, the search window determining circuit 1202, the 
window data storage unit 1 203, and a bit map storage unit 1 204. The filter 1 201 and the window data storage unit 1 203 
are the same as those in the first embodiment. 

30 The search window determining circuit 1 202 determines estimation history of previously estimated motion vectors 

obtained from the motion vector storage unit 5, determine a search window from the estimation history and stores 
pointers designating this determined search window into the window data storage unit 1 203. Also, the search window 
determining circuit 1202 determines bit-map regions 1501 and 1502 as shown in Fig. 15, and then stores this bit-map 
regions 1501 and 1502 in the bit map storage unit 1204. It should be noted that the fitter 1201 is provided, if necessary. 

35 Each of the above -described pointers are composed of, for instance, an upper left position of each of the rectangular 
reference regions of the search window, a height of each reference region, a width of each reference region, and so on. 

With reference to Fig. 6A, the reference regions 1101 to 1104 which form the search window 1100 can be repre- 
sented to have a rectangular shapes. To the contrary, each of the bit-map regions 1501 and 1502 can be defined in 
units of pixels by free curved lines, as shown by white circles in Fig. 15. In this case, a 1 -bit flag data can be added to 

40 each of the pixel data of the reference regions 1101 to 1104 within the search window to be supplied into the block 
matching unit 3. The 1-bit flag data indicates whether the pixel data is valid or invalid. 

Fig. 1 6 is a block diagram illustrating the structure of the processing element. The processing element of the block 
matching circuit in the fourth embodiment has a register 11 d for fixedly holding a maximum value, and a selector 14 
for selecting one of the output from the register 11 d and the output from the adding element 13 in response to a control 

45 signal from the control circuit 10-1. 

When a pixel data is supplied to the block matching unit 3, the control circuit 10-1 checks the flag data of the pixel 
data. When the flag data indicates that the pixel data is contained in the bit map region 1501 or 1502 shown in Fig. 15 
and is invalid, the control circuit 10-1 outputs the control signal to the processing element for the invalid pixel data to 
be supplied such that the content of the register 11 d is outputted from the output port Do via the selector 14. As a 

so result, the reference block containing the invalid pixel data of the bit map regions is excluded from a region where any 
motion vector is to be detected. In this case, the pixel data may be not read out from the reference picture data storage 
unit 2. Therefore, the calculations of the absolute value of difference and addition may be not performed in the process- 
ing element. Thus, the power consumption can be reduced. 

There are cases that the reference regions are limited in the vicinity of the right end, the left end, the upper end, 

ss and the lower end. Even when the limited reference regions are present, the flag data is added to the pixel data in the 
reference regions of the search window regions, and the block matching process is selectively performed based on 
the flag data of each pixel data. Therefore, in this case, the structure of the block matching circuit 3 is not made complex. 
As described above, even when the data width of the reference region is increased by 1-bit at most, there is no need 
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to change the structure of the block matching unit 3. 

In a motion vector estimating apparatus according to the fourth embodiment of the present invention, the block 
matching process is carried out between a certain current block and a reference block within the search window to 
detect a motion vector. The block matching circuit in the fourth embodiment is structured in a similar manner to Fig. 2, 
5 11 or 12. Also, each of the pixel data having the valid flag is supplied in a similar manner to Figs. 8A and 8B. 

In the embodiment of the present invention, as described above, since the search window can be freely determined, 
the useless calculations performed in the processing elements, and the useless data reading operation from the ref- 
erence picture data storage unit 2 are decreased. As a result, the power consumption can be lowered. 

Fig. 17 is a block diagram illustrating the structure of the motion vector estimating apparatus according to the fifth 
10 embodiment of the present invention. Referring now to Fig. 17, the motion vector estimating apparatus in the fifth 
embodiment is composed of a temperature detecting unit 6, a load detecting circuit 7, and a power supply voltage 
detecting circuit 8, in addition to the structure shown in Fig. 1. In this embodiment, the motion vector monitoring unit 
4' has the bit map storage unit 1204, like the fourth embodiment. 

As described in the first embodiment, the block matching circuit is realized on a semiconductor chip. The temper- 
15 ature detecting circuit 6 detects a temperature of the chip on which the block matching unit 3 is formed. Then, the 
temperature detecting circuit 6 generates a search window restriction data based on the detecting result to supply to 
the motion vector monitoring unit 4V 

The load detecting circuit 7 detects a system load of the block matching unit 3 such as the number of times of the 
block matching process or an amount of data to be supplied to the block matching unit 3. Then, the load detecting 
20 circuit 7 generates a search window restriction data based on the detecting result to supply to the motion vector mon- 
itoring unit 4'. 

The power supply voltage detecting circuit 8 detects the power supply voltage of the motion vector estimating 
apparatus and generates a search window restriction data based on the detecting result to supply to the motion vector 
monitoring unit 4'. 

25 in this manner, the motion vector monitoring unit 4' determines the search window based on not only the motion 

vector estimation history but also at least one of the temperature, the system load and the power supply voltage. For 
example, when the detected temperature is higher than a reference temperature, when the detected system load is 
heavier than a reference load, or when the power supply voltage is lower than a reference voltage, the search window 
may be determined to have a smaller size in addition to change of the shape and position. 

30 in this manner, the shape, size and position of the search window can be adaptively determined based on the 

estimation history of the previous motion vectors and the detected temperature, load or voltage. Then, the block match- 
ing process is carried out between the current block and each of the reference blocks within the search window to 
detect a motion vector. 

Also, in the present invention, the size, shape and position of the search window may be adaptively determined 
35 based on system load such as the load of the block matching unit 3, the chip temperature of the block matching unit 
3, or the power supply voltage. In this case, when the system toad is heavy, this implies that the process required for 
the entire system as the motion vector process is large. Therefore, the size, shape or position of the search window 
is changed. 

Since the size, shape, or position of the search window is adaptively determined based on the above<lescribed 

40 system load, no excessive system load is given to the processing capability of the motion vector estimating apparatus. 
In other words, for instance, when the chip temperature is increased, the search window is made narrower. In this 
case, since the search window becomes narrower, the calculation amount by the block matching unit 3 is restricted 
and heat dissipation of the LSI chip is restricted. Similarly, when the power supply voltage from a power source such 
as a cell used to drive the motion vector estimating apparatus is decreased, the search window may be conceivably 

45 narrowed in accordance with the detected power supply voltage. Thus, in a portable terminal equipment by using a 
cell, the operation time thereof can be extended. 

In the present invention, a motion vector of a frame previous to the current frame by 1 , a motion vector of a field 
previous to the current field by 1, a motion vector of the same frame as that of the current reference region, and a 
motion vector of the same filed as that of the current reference region may be employed as the estimation history of 

so the previously estimated motion vectors. 

Also, while utilizing features of pixel data of a reference region, for example, image data related to sports and 
image data related to movies may be referred to for determining the estimation history of motion vectors. 

In the conventional motion vector estimating method, while the same position as the current block within either 
the frame or the field is used as the center, the reference region is determined. 

ss On the other hand, in the embodiments of the present invention, the position of the search window may be adap- 

tively varied based on the above-described estimation history of motion vectors. That is to say, referring to the estimation 
history of the motion vectors, when the average value of this motion vectors is directed to a predetermined direction, 
the center position of the search window can be moved to this direction. In this case, although the search window is 
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made narrower, it is possible to detect the correct motion vector. 

Also, similar to the estimation history of motion vectors, either the shape or size of the search window may be 
conceivably changed with reference to the load of the motion vector estimating apparatus, the power suppty voltage 
and the chip temperature. In other words, for instance, while monitoring a temperature of an LSI chip on which the 
5 motion vector estimating apparatus is mounted, when this temperature is increased, the reference region is made 
narrower. In this case, since the search window becomes narrower, the calculation amount by the LSI may suppress 
heat dissipation of the LSI. 

Similarly, while monitoring a remaining power amount of a power source such as a cell used to drive the motion 
vector estimating apparatus, the search window may be conceivably narrowed in accordance with this remaining power 

10 amount. In this case, for instance, in a portable terminal by using the cell, the operation time thereof can be extended. 

Furthermore, in the embodiment of the present invention, the search window determining circuit 1202 may be 
realized by a microprocessor or the like. As a consequence, the search window determining circuit 1 202 may be mount- 
ed on another LSI. This is because the changing operation of the shape, size and/or position of the search window 
may be once carried out within 1 frame, 1 field, or a certain time period. Therefore, the process time required for 

is determining the search window does not impede the overall operation. In this case, since the algorithm for determining 
the search window can be programmed, the search window can be more flexibly determined. For example, in the case 
that certain image data is changed from the sports image into the movie image, this image change is determined by 
a human operator, so that the search window may be changed. 

As described above, according to the present invention, the below-mentioned effects can be achieved. 

20 First, in the present invention, the motion vector can be detected in high precision under low power consumption. 

This is because the dimensbn, shape and/or position of the search window can be adaptively determined based on 
previously estimated motion vectors. Also, the dimension, shape and/or position of the search window can be adaptively 
determined based on the load of the motion vector estimating apparatus such as temperature of the motion vector 
estimating apparatus, the power supply voltage and the number of time of block matching process. 

25 Second, in the present invention, when the block matching process is performed in the parallel manner, this parallel 

degree is not decreased. Asa result, according to the present invention, the block matching process can be performed 
within a short time, and the power consumption can be reduced. Also, according to the present invention, even when 
the search window is made small, the motion vector can be detected in high precision. This is because the search 
window can be flexibly determined and divided into a plurality of reference regions. Also, it is because the data supply 

30 schedule is devised. 

Third, in the present invention, the chip area for the block matching circuit can be saved and the power consumption 
can be reduced. This is because the number of buses required to enter the reference block pixel data is mall. Also, it 
is because the entry of the reference block pixel data is scheduled in such a manner that the number of stage in the 
FIFO units can be reduced. 

35 Fourth, in the present invention, even when the block matching process is carried out over a plurality of reference 

regions, it is possible to execute the block matching process such that the pipeline of the block matching circuit is 
always filled. That is, the parallel degree of the process is not decreased. As a consequence, in accordance with the 
present invention, even when the reference region is subdivided into the plural reference regions, there is no risk that 
the process is increased and the power consumption is increased. 

40 This is because the reference block pixel data of the search window is supplied via the two stage of selectors to 

the bus wiring line connected to the processing elements. 

Fifth, in the present invention, when the minimum value or likelihood is calculated from the summation of the 
difference absolute values, increasing in the total number of the wiring lines from all the processing elements to the 
minimum value detecting circuit can be suppressed. As a consequence, according to the present invention, the motion 

45 vector estimating apparatus can be realized with the smaller chip area and with reduced power consumption. This is 
because the local wiring lines are employed in the wiring lines used to connect the processing elements. 

Sixth, in the present invention, the motion vectors can be simultaneously acquired in response to a plurality of 
prediction modes of MPEG-2 (ISO/IEC 13818-2). In this case, according to the present invention, there are only the 
local wiring lines among the wiring lines connected from the processing elements, and the chip area can be reduced. 

so This is because the processing elements are divided into four sorts of processing elements as the respective processing 
elements belonging to the even row, the odd row, the upper half row, and the lower half row. 



Claims 

55 

1. A motion vector estimating apparatus comprising: 

a current picture storage unit (1) which stores image data of a current picture; 
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a reference picture storage unit (2) which stores image data of a reference picture; 

a search window determining unit (4) which determines estimation history from previousty estimated motion 
vectors, and determines a search window based on said estimation history, wherein said search window is 
composed of rectangular reference regions; 
5 a block matching circuit (300-1 , 300-2, 300-3) for performing a block matching process to a current block and 

each of reference blocks of said search window to determine a motion vector; and 

a control circuit (10-1, 10-2) which reads out the image data of said current block from said current picture 
storage unit to supply said block matching circuit, which reads out the image data of each of said reference 
blocks of said search window from said reference picture storage unit to supply to said block matching circuit, 
io and which controls said block matching circuit such that said block matching process is performed to said 

current block and said each reference block of said search window. 

2. A motion vector estimating apparatus according to claim 1 , wherein at least one of a shape of said search window, 
a size of said search window and a position of said search window is determined based on said estimation history 

is 

3. A motion vector estimating apparatus according to claim 2, wherein said search window determining unit deter- 
mines whether a number of previously estimated motion vectors exceeds a predetermined number, and neglects 
the previously estimated motion vectors to determine said size and shape of said search window, when it is de- 
termined that the number of previously estimated motion vectors is less than the predetermined number. 

20 

4. A motion vector estimating apparatus according to claim 2 or 3, wherein said search window determining unit 
determines whether a number of previously estimated motion vectors exceeds a predetermined number, produces 
a 2-dimensional map in which each of said previously estimated motion vectors is mapped based on its horizontal 
value and the vertical value, when it is determined that the number of previously estimated motion vectors exceeds 

25 the predetermined number, and determines said size and shape of the search window based on said 2-dimensional 

map. 

5. A motion vector estimating apparatus according to any one of claim 2 through 4, wherein said search window 
determining unit calculates average vector of said previously estimated motion vectors and determines said posi- 

30 tion of said search window using said average vector. 

6. A motion vector estimating apparatus according to any one of claim 1 through 5, further comprising a load detecting 
circuit (7) for detecting either one of a load of said motion vector estimating apparatus or a load of said block 
matching circuit, and 

35 wherein said search window determining unit determines said search window based on said estimation his- 

tory and the detected load. 

7. A motion vector estimating apparatus according to any one of claims 1 through 6, further comprising a voltage 
detecting circuit (8) for detecting a power supply voltage, and 

40 wherein said search window determining unit determines said search window based on said estimation his- 

tory and the detected power supply voltage. 

8. A motion vector estimating apparatus according to any one of claims 1 through 7, further comprising a temperature 
detecting circuit (6) for detecting a temperature of a chip on which said block matching circuit is installed, and 

45 wherein said search window determining unit determines said search window based on said estimation his- 

tory and the detected temperature. 

9. A motion vector estimating apparatus according to any one of claims 1 through 8, wherein said search window 
determining unit determines at least an unuseful region in said search window in unit of pixels based on said 

50 estimation history, and set a flag to each of pixels of said unuseful region such that said block matching process 

to said reference block which includes any pixel of said unuseful region is made invalid based on the flag of the 
pixel of said unuseful region. 

1 0. A motion vector estimating apparatus according to claim 9, wherein said block matching circuit includes processing 
55 elements (21) of M columns and N rows (M and N are positive integers) provided such that a likelihood between 

said current block and each of said reference blocks is calculated through pipeline processing. 

11. A motion vector estimating apparatus according to claim 10, wherein said block matching circuit further includes 
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a hierarchical selector structure (2211 to 2214, 2221 to 2224, 23) such that each of said processing elements is 
supplied with one of said pixel data of each of said reference blocks. 

12. A motion vector estimating apparatus according to claim 10, wherein said block matching circuit further includes 
s (M + N) reference data buses 206-1 to 206-4, 207-1 to 207-4), which are provided for said M columns and said N 

rows, and N current data buses provided for said N rows, respectively. 

13. A motion vector estimating apparatus according to claim 10, wherein said block matching circuit comprises: 

10 said processing elements (21 ) of M columns and N rows (M and N are positive integers) , wherein said process- 

ing elements are cascade-connected in each of said N rows, wherein each of said processing elements cal- 
culates an absolute value of a difference between a corresponding pixel data of a current block and one pixel 
data of each of reference blocks, and adds the absolute value to an output from a previous stage of processing 
element to output the adding result to a next stage of processing element, and wherein said N rows are cas- 

is cade-connected to allow pipeline processing; 

M reference data column buses (207-1 to 207-4) provided for said M columns, respectively; 

N reference data row buses (206-1 to 206-4) provided for said N rows, respectively; 

(M x N) selectors (22), which are respectively provided for said processing elements to allow one of the pixel 

data on the corresponding one of said M reference data column buses and the pixel data on the corresponding 

20 one of said N reference data row buses to said corresponding processing element as the reference block pixel 

data; and 

a cu rrent block pixel data bus (208-1 to 208-4) for transferring current block pixel data to each of said processing 
elements. 

25 14. A motion vector estimating apparatus according to claim 1 3, further comprising: 

a reference block data bus group (301 to 304) for transferring pixel data of said reference blocks in units of 
rows as sequences of pixel data; 

a first group of selectors (2211 to 2214) which are respectively provided for said M reference data column 
30 buses, and each of which selects one pixel data of one of said sequences to supply to the corresponding 

reference data column bus; and 

a second group of selectors (2221 to 2224) which are respectively provided for said N reference data row 
buses, and each of which selects one pixel data of one of said sequences to supply to the corresponding 
reference data row bus. 

35 

15. A motion vector estimating apparatus according to claim 13 or 14, wherein each of said processing elements 
latches the corresponding current block pixel data once the corresponding current block pixel data is supplied via 
said current block pixel data bus. 

40 1 6. A motion vector estimating apparatus, according to any one of claims 1 3 through 1 5, wherein each of said process- 
ing elements is synchronized with a clock signal to perform the calculation of the absolute value of the difference 
and the addition and to output the addition result to the next stage of processing element. 

17. A motion vector estimating apparatus according to claim 16, wherein said processing elements calculates a like- 
45 lihood of one of said reference blocks in a time period from a first time to (M x N) time, and calculates a likelihood 

of a different one of said reference blocks in a time period from a second time to (M x N + 1) time. 

1 8. A motion vector estimating apparatus according to any one of claims 1 3 through 1 7, wherein said current block is 
composed of pixels of M columns and N rows. 

so 

19. A motion vector estimating apparatus according to any one of claims 13 through 18, wherein said processing 
elements are cascade-connected in each of said N rows and said N rows are cascade-connected. 

20. A motion vector estimating apparatus according to any one of claims 13 through 18, wherein said processing 
55 elements are cascade-connected in each of said N rows (N is an even number), wherein odd-numbered ones of 

said N rows in an upper half are cascade-connected via first FIFO units, each of which is provided for two adjacent 
odd-numbered rows, wherein even-numbered ones of said N rows in said upper half are cascade-connected via 
second FIFO units, each of which is provided for two adjacent even-numbered rows, wherein odd-numbered ones 
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of said N rows in a tower half are cascade-connected via third FIFO units, each of which is provided for two adjacent 
odd-numbered rows, and wherein even-numbered ones of said N rows in said lower half are cascade-connected 
via fourth FIFO units, each of which is provided for two adjacent odd-numbered rows, and 

wherein said block matching circuit further includes fifth and sixth FIFO units connected to last stages of 
5 processing elements in said cascade connections for the odd-numbered rows and the even-numbered rows in 

said upper half, respectively. 

21. A motion vector estimating apparatus according to claim 20, wherein a number of stages in each of said first to 
fourth FIFO units is M, and a number of stages in said fifth FIFO unit is (M x N/2 + M), and a number of stages in 

10 said sixth FIFO unit is (Mx N/2). 

22. A motion vector estimating apparatus according to claim 20, further comprising: 

a fist adder (31) adding an output of said fifth FIFO unit and an output of said sixth FIFO unit; 
is a second adder (32) adding the output of said fifth FIFO unit and an output of the processing element in a last 

stage of odd-numbered cascade connection in said tower half; 

a third adder (33) adding the output of said sixth FIFO unit and an output of the processing element in a last 
stage of even-numbered cascade connection in said tower half; 

a fourth adder (34) adding the output of the processing element in the last stage of odd-numbered cascade 
20 connection in said lower half and the output of the processing element in the last stage of even-numbered 

cascade connection in said lower half; and 

a fifth adder (35) adding an output of said second adder and an output of said third adder. 

23. A motion vector estimating apparatus according to any one of claims 13 through 18, wherein said processing 
2S elements are cascade-connected in each of said N rows (N is an even number), wherein ones of said N rows in 

an upper half are cascade-connected via first FIFO units, each of which is provided for two adjacent rows, and 
wherein ones of said N rows in a lower half are cascade-connected via second Fl FO units, each of which is provided 
for two adjacent rows, and 

wherein said block matching circuit further includes a third FIFO units connected to the processing element 
30 in a last stage of cascade connection in said upper half. 

24. A motion vector estimating apparatus according to claim 23, wherein a number of stages in each of said first and 
second FIFO units is M, and a number of stages in said third FIFO unit is (M x N/2). 

35 25. A motion vector estimating apparatus according to claim 23, further comprising: 

a adder (35) adding an output of said third FIFO unit and an output of the processing element in a last stage 
of cascade connection in said lower half. 

26. A motion vector estimating apparatus according to any one of claims 13 through 18, wherein said processing 
40 elements are cascade-connected in each of said N rows (N is an even number), wherein odd-numbered ones of 

said N rows are cascade-connected via first FIFO units, each of which is provided for two adjacent odd-numbered 
rows, wherein even-numbered ones of said N rows are cascade-connected via second FIFO units, each of which 
is provided for two adjacent even-numbered rows, and 

45 27. A motion vector estimating apparatus according to claim 26, wherein a number of stages in each of said first and 
second FIFO units is M. 

28. A motion vector estimating apparatus according to claim 26, further comprising: 

a adder (35) adding an output of the processing element in a last stage of odd-numbered cascade connection 
so and an output of the processing element in a last stage of even -numbered cascade connection. 

29. A method of estimating a motion vector, comprising the steps of: 

determining estimation history from previously estimated motion vectors; 
55 determining a search window based on said estimation history, wherein said search window is composed of 

rectangular reference regions; and 

performing a block matching process to a current block and each of reference blocks of said search window 
to determine a motion vector. 
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30. A method according to claim 29, wherein at least one of a shape of said search window, a size of said search 
window and a position of said search window is determined based on said estimation history. 

31. A method according to claim 30, wherein said step of determining estimation history includes determining whether 
5 a number of previously estimated motion vectors exceeds a predetermined number, and 

wherein said step of determining a search window includes neglecting the previously estimated motion vec- 
tors to determine said size and shape of said search window based on said current block, when it is determined 
that the number of previously estimated motion vectors is less than the predetermined number. 

10 32. A method according to claim 30 or 31 , wherein said step of determining estimation history includes: 

determining whether a number of previously estimated motion vectors exceeds a predetermined number; and 
producing a 2-dimensional map in which each of said previously estimated motion vectors is mapped based 
on its horizontal value and the vertical value, when it is determined that the number of previously estimated 
is motion vectors exceeds the predetermined number, and 

wherein said step of determining a search window includes determining said size and shape of the search 
window based on said 2<Jimensional map. 

33. A method according to any one of claim 30 through 32, wherein said step of determining estimation history includes 
20 calculating an average vector of said previously estimated motion vectors, and 

wherein said step of determining a search window includes determining said position of said search window 
using said average vector. 

34. A method according to any one of claims 29 through 33, further comprising the steps of: 

25 

detecting either one of a bad of said motion vector estimating apparatus or a load of said block matching circuit; 
detecting a power supply voltage; and 

detecting a temperature of a chip on which said block matching circuit is installed, and 
wherein said step of determining a search window includes determining said search window based on said 
30 estimation history and at least one of the detected load, the detected power supply voltage and the detected 

temperature. 

35. A method according to any one of claims 29 through 34 : wherein said steps of: 

35 determining a search window includes determining at least an unuseful region in said search window in unit 

of pixels based on said estimation history; and 

setting a flag to each of pixels of said unuseful region such that said block matching process to said reference 
block which includes any pixel of said unuseful region is made invalid based on the flag of the pixel of said 
unuseful region. 

40 

36. A method according to claim 35, wherein said step of performing a block matching process includes performing 
said block matching process in a pipeline processing using processing elements of M columns and N rows (M and 
IM are positive integers). 

45 37. A method according to claim 36, wherein said step of performing said block matching process in a pipeline process- 
ing includes: 

in each of said processing elements, calculating an absolute value of a difference between a corresponding 
pixel data of said current block and one pixel data of each of said reference blocks; 
so adding the absolute value to an output from a previous stage of processing element; and 

outputting the adding result to a next stage of processing element. 

38. A method according to claim 34, wherein said calculating step includes, in each of said processing elements, 
latching the corresponding current block pixel data once the corresponding current block pixel data is supplied via 

55 said current block pixel data bus. 

39. A method according to claim 37, wherein said step of performing said block matching process in a pipeline process- 
ing includes, in each of said processing elements, performing the calculation of the absolute value of the difference 
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and the addition and to output the addition result to the next stage ol processing element in synchronous with a 
clock signal. 

40. A method according to claim 39, wherein said step ol performing said block matching process in a pipeline process- 
5 ing includes: 

calculating a likelihood of one of said reference blocks in a time period from a first time to (M x N) time; and 
calculating a likelihood of a different one of said reference blocks in a time period from a second time to (M x 
N + 1)time. 
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